Direct effects testing: a two-stage procedure to test for effect size and variable importance for correlated binary predictors and a binary response.
نویسندگان
چکیده
In applications such as medical statistics and genetics, we encounter situations where a large number of highly correlated predictors explain a response. For example, the response may be a disease indicator and the predictors may be treatment indicators or single nucleotide polymorphisms (SNPs). Constructing a good predictive model in such cases is well studied. Less well understood is how to recover the 'true sparsity pattern', that is finding which predictors have direct effects on the response, and indicating the statistical significance of the results. Restricting attention to binary predictors and response, we study the recovery of the true sparsity pattern using a two-stage method that separates establishing the presence of effects from inferring their exact relationship with the predictors. Simulations and a real data application demonstrate that the method discriminates well between associations and direct effects. Comparisons with lasso-based methods demonstrate favourable performance of the proposed method.
منابع مشابه
Binary Regression With a Misclassified Response Variable in Diabetes Data
Objectives: The categorical data analysis is very important in statistics and medical sciences. When the binary response variable is misclassified, the results of fitting the model will be biased in estimating adjusted odds ratios. The present study aimed to use a method to detect and correct misclassification error in the response variable of Type 2 Diabetes Mellitus (T2DM), applying binary ...
متن کاملLogic regression and its application in predicting diseases
Regression is one of the most important statistical tools in data analysis and study of the relationship between predictive variables and the response variable. in most issues, regression models and decision tress only can show the main effects of predictor variables on the response and considering interactions between variables does not exceed of two way and ultimately three-way, due to co...
متن کاملCompressed Liquid Densities for Binary Mixtures at Temperatures from 280- 440K at Pressures up to 200 MPa
A method for predicting liquid densities of binary mixtures from heat of vaporization and liquid densityat boiling point temperature (ΔHvap and n b ρ ) as scaling constants, is presented. B2(T) follows a promisingcorresponding-states principle. Calculation of α(T) and b(T), the two other temperature-dependentconstants of the equation of state, are made possible by scaling. As a result ΔHvap and...
متن کاملExtension of Logic regression to Longitudinal data: Transition Logic Regression
Logic regression is a generalized regression and classification method that is able to make Boolean combinations as new predictive variables from the original binary variables. Logic regression was introduced for case control or cohort study with independent observations. Although in various studies, correlated observations occur due to different reasons, logic regression have not been studi...
متن کاملتعیین حجم نمونه در داده های دوتایی برای دو گروه مستقل در مطالعات پزشکی
Backgound and Objectives: Nowadats, it is very common to investigate the effect of a new method or medicine by using a comparison between two independent groups in many clinical studies, but unfortunately the sample size part has not been done based on scientific formulas but on practical issues in some researches of this type of cinical studies, and this makes some unreliable results. Therefor...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید
ثبت ناماگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید
ورودعنوان ژورنال:
- Statistics in medicine
دوره 29 24 شماره
صفحات -
تاریخ انتشار 2010